Speech LSF quantization with rate independent complexity, bit scalability and learning
نویسندگان
چکیده
A computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF) is proposed. In this scheme, speech line spectral frequencies (LSF) are modeled as i.i.d realizations of a multivariate Gaussian mixture density. The mixture model parameters are efficiently estimated using the Expectation Maximization (EM) algorithm. An efficient quantization scheme using transform coding and bit allocation techniques which allows for easy and computationally efficient mapping from observation to quantized value is developed for both fixed rate and variable rate systems. An attractive feature of this method is that source encoding using the resultant codebook involves very few searches and its computational complexity is minimal and independent of the rate of the system. Furthermore, the proposed scheme is bit scalable and can switch between memoryless and quantizer with memory seamlessly. The performance of the memoryless quantizer is 2-3 bits better than conventional quantization schemes.
منابع مشابه
Improved Modeling and Quantization Methods for Speech Coding
With the advent of 3G Wireless standards and subsequent bandwidth expansion, there is a clear need to design high quality, low complexity compression schemes which are bit-efficient. We have proposed a computationally efficient, high quality, vector quantization scheme based on a parametric probability density function (PDF). In this scheme, speech line spectral frequencies (LSF) are modeled as...
متن کاملClassified Vector Quantization of Lpc Parameters
To achieve high coding efficiency, modern speech coders adopt hybrid coding approaches, which utilize different coding mechanisms for various classified speech segments. With known voiced/unvoiced detection, in this paper, a classified LPC quantization (CLPQ) scheme is presented to effectively encode line spectral frequencies (LSF). The proposed CLPQ scheme improves the performance of the class...
متن کاملIntra-frame and Inter-frame Coding of Speech LSF Parameters Using A Trellis Structure
Linear Predictive Coding (LPC) parameters are widely used in various speech processing applications for representation of the spectral envelope of speech. Low bit-rate speech coding applications, require accurate quantization of these parameters using as few bits as possible. Line Spectral Frequency (LSF) representation is the most widely accepted representation of LPC parameters for quantizati...
متن کاملNormalized two stage SVQ for minimum complexity wide-band LSF quantization
We develop a two stage split vector quantization method with optimum bit allocation, for achieving minimum computational complexity. This also results in much lower memory requirement than the recently proposed switched split vector quantization method. To improve the rate-distortion performance further, a region specific normalization is introduced, which results in 1 bit/vector improvement ov...
متن کاملRobust speech mode based LSF vector quantization for low bit rate coders
Robust vector quantization of LSF parameters at a low bit rate is essential for voice coders operating below 5 Kbps. A novel aspect of the proposed technique is the use of decorrelated residual LSF vectors from speech mode based backward prediction along with a multi-stage VQ design. Rates as low as 12 bits per 20 ms speech frame for the stationary voiced speech mode and 22 bits/frame for unvoi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001